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This report describes a search for associated production of W and Higgs bosons based on data 
corresponding to an integrated luminosity of C ~ 5.3 fb _1 collected with the DO detector at the 
Fermilab Tevatron pp Collider. Events containing a W — » tv candidate (with £ corresponding to c 
or fi) are selected in association with two or three reconstructed jets. One or two of the jets are 
required to be consistent with having evolved from a b quark. A multivariate discriminant technique 
is used to improve the separation of signal and backgrounds. Expected and observed upper limits 
are obtained for the product of the WH production cross section and branching ratios and reported 
in terms of ratios relative to the prediction of the standard model as a function of the mass of the 
Higgs boson (Mr)- The observed and expected 95% C.L. upper limits obtained for an assumed 
Mh = 115 GeV are, respectively, factors of 4.5 and 4.8 larger than the value predicted by the 
standard model. 

PACS numbers: 14.80.Bn,13.85.Rm 



I. INTRODUCTION 

In the standard model (SM) of particle physics, the 
masses of the weakly interacting W and Z gauge bosons 
are accommodated through the process of electroweak 
symmetry breaking, and the masses of fermions through 
their Yukawa couplings to the Higgs held. The search for 
the Higgs boson, whose mass Mh is not predicted by the 
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SM, is a test of this hypothesis and is a major compo- 
nent of the experimental programs at particle colliders. 
At the Fermilab Tevatron pp Collider, this search is car- 
ried out using multiple statistically independent search 
samples, each sensitive to different Higgs boson produc- 
tion processes and decay channels, providing increased 
sensitivity in the search for direct evidence for this SM 
mechanism [1, 2]. 

This paper presents an extended description of the pre- 
viously reported search [3] for SM Higgs boson produc- 
tion through the process pp — > WH, in which a Higgs 
boson is produced in association with a W boson. The 
search is based on data corresponding to an integrated 
luminosity C « 5.3 fb _1 collected with the DO detector 
at the Fermilab Tevatron pp Collider with a center-of- 
mass energy y/s = 1.96 TeV. The events are required 
to contain a W — > ev or W — > [iv candidate, thereby 



suppressing background from inclusive 6-jet production 
processes, and enhancing sensitivity to signal by several 
orders of magnitude. The event selection is also sensitive 
to W — >■ tv events with r decay into electrons or muons. 
The events are required to contain a H — > bb candidate 
because of large branching fraction for this decay in the 
M H region considered here (100 < M H < 150 GeV). 
The experimental signature is therefore a single isolated 
lepton, an imbalance in the measured transverse energy 
($t), an d either two or three jets, at least one of which is 
consistent with having been initiated by a b quark. The 
three-jet sample is included to provide additional sen- 
sitivity for WH events containing gluon radiation from 
the initial or final-state particles of the hard collision. 
The data are examined in separate search samples of dif- 
ferent sensitivity and a multivariate random forest tech- 
nique [4, 5] is applied to each sample, further enhancing 
sensitivity to signal. 

Direct searches for the process e + e~ — >• ZH at the 
CERN e+er Collider (LEP) experiments set the SM 
Higgs mass to Mh > 114.4 GeV [6]. In addition, a fit to 
electroweak precision measurements of the masses of the 
W boson and the top quark from both Tevatron and LEP 
experiments leads to an upper limit of Mh < 161 GeV for 
SM Higgs production at the 95% C.L. [7]. Both the CDF 
and DO Collaborations have extensively investigated the 
WH associated production mechanism [8-13], and a re- 
gion at larger 156 < Mh < 177 GeV has also been ex- 
cluded at 95% C.L. by direct searches for H -*• W + W~ 
decays [14]. Results from the CERN Large Hadron Col- 
lider (LHC) Collaborations [15, 16] also exclude regions 
at higher Mh > 127 GeV and indicate that the most 
interesting region for the search for the SM Higgs boson 
is the one where the sensitivity of the search discussed 
in this article is maximal. The analysis presented here 
is expected to be a highly sensitive channel in the mass 
range 100 < Mh < 135 GeV, and complements searches 
at the LHC which rely primarily on different SM Higgs 
production and decay mechanisms in this mass range. 



tracking and trigger systems were installed during the 
summer of 2006 and the data samples collected prior to 
and after this upgrade are referred to as pre- and post- 
upgrade samples in the following. 



A. Tracking Detectors 

The DO tracking system surrounds the interaction 
point and consists of an inner silicon microstrip tracker 
(SMT) followed by an outer central scintillating fiber 
tracker (CFT). Both the SMT and CFT are situated 
within a 2 T magnetic field provided by a superconduct- 
ing solenoidal coil surrounding the entire tracking system. 

The silicon microstrip tracker is used for tracking 
charged particles and reconstructing interaction and de- 
cay vertices. In the central region there are six barrel 
sections each comprising four detector layers. The bar- 
rel sections are interspersed and capped with disks com- 
posed of 12 double-sided silicon wedge detectors. The 
first and second detector layers of each barrel contain 
12 silicon modules and 24 modules are installed in the 
third and fourth detector layers. An additional inner 
layer was added to the silicon tracker system in 2006 
[19]. In the high \rj\ region on either side of the three 
disk-barrel assemblies there are three further radial disk 
sections (F-disks), and in the far- forward region, large- 
diameter disks (H-disks) provide tracking at larger |r/|. 
The tracks of particles with \rj\ < 1.7 are measured using 
the CFT and the barrel and F-disk sections of the SMT, 
whereas tracks for particles at larger \rj\ are reconstructed 
using the the F- and H-disks. 

The CFT comprises scintillating fibers (835 /xm in di- 
ameter) mounted on eight concentric support cylinders. 
The cylinders occupy the radial space from 20 to 52 cm 
from the center of the beam pipe. The two innermost 
cylinders are 1.66 m long whereas the outer six cylinders 
are 2.52 m long. The outer cylinder provides tracking 
coverage extending to \rj\ = 1.7. 



II. THE DO DETECTOR 



B. Calorimeters 



The main components of the DO detector used in this 
investigation are the tracking detectors, calorimeters, 
muon detectors, and the luminosity system. Protons and 
antiprotons interact close to the origin of the DO detector 
coordinate system, which is at the center of the detector. 
A right-handed Cartesian coordinate system is used with 
the positive z axis pointing along the nominal direction 
of the incoming proton beam (the positive y axis points 
toward the top of the detector) and the pseudorapidity 
variable, defined as 77 = — In tan |, where 8 is the polar 
angle in the corresponding spherical polar coordinate sys- 
tem. The kinematic properties of particles and jets are 
measured with respect to the reconstructed pp collision 
vertex. More details on DO construction and component 
design are available in Refs. [17, 18]. Upgrades to the 



The DO calorimeter system is used to measure energies 
as well as to identify electrons, photons, and jets. The 
calorimeter also helps to identify muons and provides a 
measure of the $t in events. The central calorimeter 
(CC) and the two end calorimeters (EC) are contained 
within three individual cryostats located outside of the 
superconducting solenoid. The central calorimeter covers 
detector pseudorapidities \r]\ < 1.1 and the end calorime- 
ters extend the range to \rj\ = 4.2. The active material in 
each calorimeter section is liquid argon. Extending radi- 
ally outwards from the detector center, the calorimeters 
are subdivided into electromagnetic (EM), fine hadronic, 
and coarse hadronic (CH) sections. The absorber mate- 
rial of the EM sections is uranium, whereas for the fine 
hadronic sections a uranium-niobium alloy is used. The 



CH absorbers are made of copper in the CC region and 
stainless steel in the EC region. To improve measure- 
ments in the intercryostat regions, plastic-scintillator de- 
tectors and "massless gap" detectors are used to sample 
showers between cryostats, enhancing calorimeter cover- 
age in the region 0.8 < \rj\ < 1.4. 



C. Muon Detectors 

The muon detector system [20] consists of a central 
muon detector system covering the range \r)\ < 1 and a 
forward muon system that covers the region 1 < \r]\ < 2. 
The central muon system comprises aluminum propor- 
tional drift chambers whereas aluminum mini drift tubes 
are used in the forward system. Scintillation counters 
are included for triggering purposes, and 1.8 T toroidal 
magnets make it possible to determine muon momenta 
and perform tracking measurements based on the muon 
system alone. 

The proportional drift chambers are arranged in three 
layers, one of which (A layer) is located within the toroid, 
with the remaining two (B and C) layers located beyond 
the toroid, with the C layer radially furthest from the 
interaction point. In the central muon system, the B and 
C layers have three planes of drift cells. The A layer has 
four planes, except at the support structure at the bot- 
tom of the detector, where the A layer has three planes 
of cells. In the forward region, mini drift tubes are ar- 
ranged in eight octants with four planes in the A layer 
while the B and C layers each have three planes. 



D. Luminosity System 

The DO luminosity system is used to determine the in- 
stantaneous luminosity and also to measure beam-halo 
rates. The system is composed of two disks of scintil- 
lating tile detectors that are positioned in front of the 
ECs on both sides of the DO detector at z = ±140 cm. 
Each of the disks consists of 24 plastic scintillation coun- 
ters that cover pseudorapidity regions 2.7 < |r;| < 4.4. 
The total integrated luminosity (£) is determined via the 
average instantaneous number of observed inelastic col- 
lisions (A?i ne i), according to fNi ne \/ai ne \, where / is the 
frequency of pp Tevatron bunch crossings, and <7i ne i is the 
effective inelastic production cross section [21] within the 
luminosity system acceptance, after taking into account 
beam-halo events and multiple collisions within a single 
beam crossing. In practice, N- mc \ is calculated by invert- 
ing the Poisson probability of observing no hits in either 
of the two disks [22] . 



III. TRIGGERING 

The DO trigger system has three levels referred to as 
LI, L2, and L3. Each consecutive level receives a lower 



rate of events for further examination. The L2 software- 
based algorithms refine the LI information they receive 
and the L3 software-based algorithms then run simplified 
versions of offline identification algorithms based on the 
full detector readout. 

The W —¥ ev candidates of this search are collected 
using the logical OR [23] of different triggers requiring a 
candidate electromagnetic object. The LI electron trig- 
gers require calorimeter energy signatures consistent with 
those of an electron. The logical OR also includes trigger 
algorithms requiring an electromagnetic object together 
with at least one jet, for which the LI requirement in- 
cludes a calorimeter energy deposition expected for jets 
at large transverse momenta px- The triggers have dif- 
ferent minimum electron and jet pt thresholds, and each 
has a typical efficiency of (90-100)% for the signal events 
satisfying the selection requirements discussed below, de- 
pending on the trigger type and sampled region of the de- 
tector. The trigger efficiencies are determined using sam- 
ples of Z/j* — > e + e~ events and are modeled as functions 
of the pt and rj of the leading (largest pr) electromag- 
netic object in the event. Event weights are used to apply 
the measured trigger efficiencies to the simulated signal 
and background samples. Since the triggers undergo peri- 
odic changes, these efficiencies depend on specific running 
periods. In particular, an improved calorimeter trigger 
was added during the 2006 detector upgrade [24]. 

W — *• [xv candidates are triggered using the logical OR 
of the full set of available triggers and expected to be fully 
efficient for the selection criteria used. For muons, the se- 
lected pseudorapidity range of this analysis is \r\\ < 1.6, 
where the majority of the events are collected by triggers 
that require a large-py muon at LI (single-muon trig- 
gers). The efficiency of the single-muon triggered com- 
ponent of the data is determined using Z/j* — > /i + /i~ 
events, again separately for specific running periods. It 
is typically w 70% and is well modeled in simulation. 
The remainder of the events are collected primarily us- 
ing jet triggers. The efficiency for these triggers is deter- 
mined separately by taking the ratio of this component of 
the triggered data set to Monte Carlo (MC) simulation 
with triggering probabilities set to unity [after correct- 
ing the data for multijet (MJ) background as described 
separately in Sec. VII]. The ratio is parameterized as a 
function of the scalar sum (H?) of the transverse mo- 
menta of the jets in the event, and compared to the well- 
modeled single-muon triggered data set. The simulated 
probability for events to pass at least one of the single 
high-pT muon triggers is then scaled to the efficiency of 
the complete set of triggers used. The most recently col- 
lected data correspond to the highest instantaneous lu- 
minosities, and because different proportions of multijet, 
-^T+jet, and muon+jet triggered events are observed as 
a function of luminosity, the additional probability factor 
is computed separately for events collected before and af- 
ter the 2006 DO upgrade. The remaining triggers provide 
a gain in probability of s=s 0.23 before the 2006 upgrade 
and range from 0.23-0.33 following the upgrade. 



After additional detector status quality requirements, 
applied to ensure subdetector systems are operational, 
the total integrated luminosity is C — 5.32 fb _1 for the 
electron channel and L = 5.36 fb _1 for the muon chan- 
nel. The contribution to the total integrated luminosity 
from the pre-2006 upgrade part of the data set is about 
1 fb _1 in each case. The uncertainty on the experimen- 
tally measured integrated luminosity is 6.1% [22] and is 
dominated by the uncertainity in the effective inelastic 
production cross section [21]. 



IV. IDENTIFICATION OF LEPTONS, $ T , 
AND JETS 



Candidate events with W bosons are selected by re- 
quiring a single reconstructed lepton together with large 
$t and the selected W — > iv samples are also required 
to contain either two or three reconstructed jets. 

Electrons of px > 15 GeV are reconstructed in the 
CC or EC calorimeters in the pseudorapidity regions 
\r/\ < 1.1 and 1.5 < \r}\ < 2.5, respectively. In the 
CC (EC), a shower is required to deposit 97%(90%) 
of its total energy [as measured in a cone of radius 
AR = ^J(Arj) 2 + (A(f>) 2 = 0.4] within a cone of radius 
AR = 0.2 in the electromagnetic layers. The showers 
must have transverse and longitudinal distributions that 
are consistent with those expected from electrons. In 
the CC region, a reconstructed track, isolated from other 
tracks, is required to have a trajectory that extrapolates 
to the EM shower. The isolation criteria restricts the sum 
of the scalar pr of tracks o f px > 0-5 GeV within a hol- 
low cone of radius AR = ^{Arf) 2 + (A</>) 2 = [0.05 - 0.4] 
surrounding the electron candidate to < 2.5 GeV. Ad- 
ditional information on the number and scalar pj- sum 
of tracks in cone of radius AR = 0.4 surrounding the 
candidate cluster, track to cluster matching probability, 
the ratio of the transverse energy of the cluster and the 
transverse momentum of the track associated with the 
shower, the EM fraction, and lateral and longitudinal 
shower shape characteristics are used to construct CC 
and EC electron likelihood discriminants. The discrim- 
inants are trained using Z/-y* — *• e + e~ events and are 
applied to ensure that the observed particle characteris- 
tics are consistent with electrons [25]. 

Muons of pt > 15 GeV are selected in the region 
1 77 1 < 1.6. Muons are required to have track segments 
in both the A and B or C layers of the muon detec- 
tors, with a spatial match to a corresponding track in 
the central tracker. The scalar sum of the px of tracks 
with AR < 0.5 around the muon candidate is required 
to be less than 2.5 GeV. Furthermore, transverse en- 
ergy deposits in the calorimeter in a hollow cone of 
AR = [0.1 — 0.4] around the muon must be less than 
2.5 GeV. To suppress MJ background events that orig- 
inate from semileptonic decays of hadrons, muon candi- 
date tracks are required to be spatially separated from 
jets by AR(n,j) > 0.5. To suppress cosmic-ray muons, 



scintillator timing information is used to require the hits 
to coincide with a beam crossing. 

In addition to the selection criteria listed above, elec- 
trons and muon samples are also selected using much 
looser reconstruction criteria. For the electron channel, 
less restrictive calorimeter isolation and EM energy frac- 
tion criteria are used and the likelihood discriminants are 
not applied. For the muon channel, less restrictive energy 
isolation and track-momentum criteria are used. These 
samples are used only for the determination of the MJ 
background contributions to the final selected samples as 
described in Sec. VII. 

The $t is calculated from individual calorimeter cell 
energies in the electromagnetic and fine hadronic parts 
of the calorimeter and is required to be $t > 20 GeV 
for both the electron and muon channels. It is corrected 
for the presence of any muons. All energy corrections 
to leptons and to jets (including energy from the coarse 
hadronic layers associated with jets) are propagated to 
the fa. 

Jets are reconstructed in the calorimeters in the re- 
gion I r\ I < 2.5 using the DO Run II iterative midpoint 
cone algorithm, from energy deposits within cones of size 
AR = 0.5 [26]. To minimize the possibility that jets are 
caused by noise or spurious energy deposits, the fraction 
of the total jet energy contained in the EM layers of the 
calorimeter is required to be between 5% and 95%, and 
the energy fraction in the CH sections is required to be 
less than 40%. To suppress noise, different cell energy 
thresholds are also applied to clustered and to isolated 
cells. The energy of the jets is scaled by applying a cor- 
rection determined from 7+jet events using the same jet 
finding algorithm. This scale correction accounts for ad- 
ditional energy (e.g., residual energy from previous bunch 
crossings and energy from multiple pp interactions) that 
is sampled within the finite cone size, the calorimeter 
response to particles produced within the jet cone, and 
energy flowing outside the cone or moving into the cone 
via detector effects (e.g the deflection of particles by the 
magnetic field). Details of the DO jet energy scale cor- 
rection can be found in Ref. [27]. 

In addition to the previously mentioned jet energy 
scale correction, derived using 7+jet events, residual cal- 
ibration differences between simulated and data-selected 
jets are also studied using Z{— > e + e~)+jet events. An 
additional energy recalibration and an energy smearing 
are then determined to adjust the pt imbalance between 
the Z boson and the recoiling jet in simulation to that 
observed in data. The correction is applied in simulation 
to gluon-dominated jet production processes. Differences 
in reconstruction thresholds in simulation and data are 
also taken into account. 

The jet identification efficiency and jet resolution are 
adjusted in the simulation to match those measured in 
data. Following the 2006 upgrade of the DO detector 
to handle higher instantaneous luminosity, all jets are 
also required to satisfy additional criteria for originating 
from the primary pp vertex ("vertex confirmation"). The 



criteria are that the jets have at least two tracks, each of 
which have p? > 0.5 GeV, at least one hit in the SMT 
detector, and distances of closest approach (DCA) of < 
0.5 and < 1.0 cm from the primary pp interaction vertex 
in the transverse plane and along the z axis, respectively. 



V. TAGGING OF b QUARK JETS 

The final sample of WH candidate events is selected 
by requiring that at least one of the jets produced in 
association with the W boson is consistent with having 
been initiated by a b quark, using the neural network 
(NN) ^-tagging algorithm described in detail in Ref. [28] . 

Jets considered by the 6-tagging algorithm are required 
to pass a "taggability" requirement that utilizes charged- 
particle tracking and vertexing information. The effi- 
ciency of this requirement accounts for variations in de- 
tector acceptance and track reconstruction efficiencies at 
different z position values of the primary vertex (PV) 
through the interaction region, prior to the application 
of the 6-tagging algorithm, and depends on the z posi- 
tion of the PV and the r\ of the jet. More details on the 
reconstruction and selection of the primary interaction 
vertex are available in Ref. [28] . The taggability require- 
ment is that a calorimeter jet be matched to a track-jet 
within an angular separation of AR < 0.5. Track-jets 
are formed starting from seed tracks of pt > 1 GeV with 
at least one hit in the SMT detector and DCA require- 
ments of < 0.15 cm and < 0.4 cm to the primary ver- 
tex in the transverse plane and along the z axis, respec- 
tively. The other tracks used to form track-jets must have 
Pt > 0.5 GeV. To reduce the probability of misidenti- 
fied secondary vertices, tracks consistent with the decay 
of a long lived particle (e.g. K s , A) or a converted pho- 
ton are also removed, before application of the 6-tagging 
algorithm. 

The efficiency of the taggability requirement in the se- 
lected samples is studied in data and in simulation, in 
four z-vertex intervals, as a function of jet r/ and pt- 
Correction factors are determined and applied to the MC 
separately for the pre- and post- upgrade parts of the sim- 
ulated samples. The corrections, which are of order 1%, 
are applied as a function of jet 77 (post-upgrade) and jet 
Pt (pre-upgrade) . The jet taggability efficiency is largest 
(80%-90%) around the center of the interaction region. 
More details on jet taggability and its efficiency can be 
found in Ref. [28]. 

The NN 6-tagging algorithm uses seven input variables, 
five of which make use of secondary decay vertex infor- 
mation. These are the invariant masses (calculated from 
all contributing tracks assuming the pion mass) of sec- 
ondary vertices, the number of tracks used to reconstruct 
the secondary vertex, the x 2 of the secondary vertex fit, 
the decay length significance of the secondary vertex with 
respect to the primary vertex in the transverse plane, 
and the number of secondary vertices reconstructed in 
the jet. Two further impact-parameter-based variables 



are also used. The first is a discrete signed impact pa- 
rameter significance variable, which is a combination of 
four quantities related to the number and the quality of 
tracks within a cone of radius A_R = 0.5 centered on the 
calorimeter jet. The second is a continuous jet-lifetime 
variable, which is used to assign a total probability that 
tracks within a jet are consistent with the primary vertex 
position. The variable is calculated using the product of 
individual track probabilities, which indicate the likeli- 
hood that each track is consistent with the primary ver- 
tex position. The individual probabilities are based on 
the impact parameter resolution of the tracks. The track 
impact parameters are given the same sign as the scalar 
product of the track DCA in the transverse plane and the 
jet Pt- The negative signed region is used to calibrate the 
impact parameter resolution whereas tracks with positive 
values are used to calculate the total lifetime probability. 



VI. MONTE CARLO SIMULATION 

At each step of the selection, the data are compared 
to predictions obtained by combining the MC simulation 
of SM backgrounds with a data-based estimation of the 
instrumental background from MJ events containing 
misidentified leptons (discussed separately in Sec. VII). 
All generated samples are passed through a detailed, 
GEANT-based simulation [29] of the DO detector and the 
same reconstruction algorithms used for data. Separate 
simulations are applied for conditions prior to and after 
the 2006 detector upgrade. The SM predictions are used 
to set the relative normalizations for all of the generated 
samples, and additional reweighting factors are then 
applied to normalize samples generated using the leading 
order (LO) ALPGEN [30] MC event generator to data. 
These factors are determined prior to the application 
of 6-tagging (see Sec. V), where the signal contribution 
is expected to be negligible, and these are determined 
simultaneously with the MJ background, which is also 
obtained from data. The impact of multiple pp inter- 
actions and detector noise is accounted for by adding 
data events recorded during random beam crossings 
to the simulated events before they are reconstructed. 
The instantaneous luminosity profile of these events is 
matched, prior to the application of fe-tagging, to that 
observed in the selected data samples. For all the MC 
samples the effects of beam remnants and of multiple 
partonic interactions (underlying event) are modeled 
using the PYTHIA parameters obtained from data in Ref. 
[31]. 

• WH production: The WH associated production 
process, with subsequent decay of the Higgs boson to a 
bb quark-antiquark pair, is modeled using the pythia 
[32] MC event generator, according to the prescription 
of Refs. [33-37] and the LH2003 Working Group [38]. 
The events are generated using the CTEQ6L LO parton 
distribution functions [391 with the renormalization and 



factorization scales set to the Higgs boson mass Mh- 

Eleven samples in total are generated, for Mh values 
spanning the range Mh — 100 — 150 GeV in intervals 
of 5 GeV. Similarly, a set of 11 qq — )• ZH signal 
samples is also generated with pythia to model the 
small contribution of signal events from ZH associated 
production that passes all selections. These events are 
selected if one of the leptons from the decay Z — > £ + £~ 
is either not reconstructed or is produced outside of the 
detector acceptance. The WH and ZH samples are 
referred to collectively as WH in the figures and the 
remainder of the text. 

• VF+light partons: The SM background processes 
W{— > £f)qq, where q represents light quarks (u, d, s) and 
gluons are generated using the LO MC matrix element 
event generator ALPGEN according to the parton-level 
cross section calculations of Ref. [39] . Separate samples 
are generated for light parton multiplicities 0,1,2,3,4, 
and > 5 with each case generated for each of the 
final-state decay lepton flavors £ = e, fi, and r. The 
pythia generator is used to account for the subsequent 
hadronization and development of partonic showers. 
The MLM factorization ("matching") scheme [40] is used 
to avoid the possibility of overestimating the probability 
of further partonic emissions produced in pythia. The 
samples are then normalized to data as described in 
Sec. VI A. To avoid double counting of heavy quarks, 
Wbb and Wcc events, which are generated separately as 
described below, are removed. 

• Z/7*+light partons: A corresponding set of 
Z/j*(— > ££)qq samples are generated for light-parton 
multiplicities 0, 1, 2, and > 3. These samples also 
include each of the lepton flavors £ = e, fj,, and r. 
The Z/j* contributions are generated over the Z/"f* 
mass region M« — 15 — 250 GeV for £ = e,/i, and 
M u = 75 - 250 GeV for t decays. The combined 
W(-^? £v)qq and Z/'y*(—t £€)qq samples are referred to 
as W+light in the figures and the remainder of the text. 

• Wbb, Wcc: The channel W(-t £v)bb and also 
the channel W(—^ £v)cc (referred to collectively as 
Wbb) are generated using ALPGEN also according to 
the initial prescription of Ref. [41]. The PYTHIA gen- 
erator is again used to account for subsequent shower 
development and the MLM matching scheme is again 
used for the treatment of further partonic emissions. 
Four separate samples are generated for 0, 1, 2, and > 3 
additional light partons. To avoid double counting, 
Wcc states are removed from the W{— > £v)bb samples, 
and no events are removed from the W(— > £v)cc samples. 

• Z/~f*bb, Z/-f*cc: Corresponding samples of 
Z/'y*(—t ££)bb and Z/'y*(—t U)cc events are generated 
for each lepton flavor £ — e,/j,,T and for 0, 1, and 
> 2 additional light parton multiplicities. The com- 
bined W(-+ £v)bb, W(-+ £v)cc, Z/7*H M)bb, and 



Z/j*(-^ U)cc samples are referred to as Wbb in the 
figures and the remainder of the text. 

• ti: The background from ti interactions is generated 
using ALPGEN, again interfaced with pythia, and 
using the MLM matching scheme. The cross section 
predictions contain the most important terms of the 
next-to-NLO (next-to-next-to-LO) corrections [42]. The 
ti —¥ bb + £ + v£ _ Vfi and ti —¥ bb + 2j + iv final states are 
considered, including 0, 1, and 2 additional light parton 
multiplicities, and all decay lepton flavors £ — e,/j,,T. 

• Single top quarks: Background processes ini- 
tiated by single top quark production are generated 
using CompHep [43, 44]. The cross sections [45] are 
calculated at NLO and pythia is again used for subse- 
quent hadronization and partonic-shower development, 
along with the MLM matching scheme. The s-channel 
(tb — > £vbb) processes and i-channel (tqb —J- Ivbqb) 
processes are generated for the three lepton flavors 
I = e, fi, and r. The single top samples are referred to 
collectively as s — top in the figures and the remainder 
of the text. 

• Diboson: Backgrounds from the hadronic produc- 
tion of diboson pairs ( pp — > V1V2, where Vi, V2 = W ± . 
or Z/j*) are simulated using pythia. The cross sections 
are calculated at NLO according to the prescription of 
Ref. [46], obtained using the mcfm program, and incor- 
porating spin correlations in partonic matrix elements. 
The diboson samples are generated inclusively for all bo- 
son decay leptonic flavors £ = e, /j,,t and are referred to 
collectively as WZ in the figures and remainder of the 
text. 



A. MC Reweighting 

Because of problems in the modeling of background 
processes in MC simulations, we apply corrections sum- 
marized in the following. Distributions of the summed 
VF+light and Wbb simulated samples are compared to 
data, prior to the application of 6-tagging, and correc- 
tions are developed to reweight shape discrepancies. The 
correction factors are calculated, prior to the determina- 
tion of the ALPGEN normalization factors. These correc- 
tions are motivated by previous comparisons of ALPGEN 
with data [47] and with other event generators [40]. The 
overall event yields are preserved in the reweighting, and 
the same weight functions are applied to all the TT+jets 
ALPGEN backgrounds, at reconstruction level in the MC. 
In this section, we describe the applied reweighting func- 
tions in detail. 

The reweighting functions are determined from the ra- 
tio of the total VF+light and Wbb distributions to the cor- 
responding distributions obtained from the high statistics 
selected Nyy+\ e ts component of the data. The expected 
signal contribution is negligible in this sample. N^^- ts 



is obtained after correcting the total selected data sample 
-^data f° r MJ background (TVmj) and the total expected 
contributions from other SM background sources (-/Vg^) 1 



N W+)cts — ^data 



N- 



MJ 



JVc 



MC 



SM 



(1) 



Prior to determining the ALPGEN reweightings, cali- 
bration differences in data and MC for overlaid events 
in the post-2006 upgrade samples are corrected for. Two 
reweighting constants are applied which scale down the 
MC, for the leading and subleading jet \rj\ distributions, 
thereby improving detector modeling in the intercryo- 
stat pseudorapidity region 0.8 < \rj\ < 1.4. Separate 
constants are used for the positive and negative pseudo- 
rapidity intervals, for each of the two leading jets. The 
reweighting constants reduce the simulated contribution 
by 1%-10%. 

The overall description of the N^^- ts lepton i] distri- 
bution as well as the corresponding leading and sublead- 
ing jet r\ distributions are then adjusted by applying a 
first-order polynomial reweighting function in i] 2 of the 
simulated lepton r\ and second-order reweighting func- 
tions in rj 2 to the r\ of the leading and subleading jets. 
Only the two leading jets are reweighted in the W+3 jet 
selections. These reweighting functions have the primary 
effect of improving the ALPGEN description of the r\ dis- 
tributions by increasing the MC component for \r\\ > 1.5. 

Discrepancies observed in the correlation between the 
jet directions A_R(ji j'2) and the W boson px are cor- 
rected through two reweighting functions in the two- 
dimensional AR—p^ plane. The functional form is a 
third-order polynomial in AR(ji j'2), increasing the ALP- 
GEN simulation by w 20% at large AR, times a con- 
stant plus Gaussian error function reweighting in W bo- 
son pr, applied to the W boson ALPGEN samples only, 
and which primarily increases the simulation by w 20% 
for p^ < 20 GeV. The pr distribution for Z/j* produc- 
tion is also adjusted to agree with the observed distribu- 
tion. The systematic uncertainties associated with these 
reweightings are discussed in Sec. X. 



B. alpgen Normalization Factors 

Two multiplicative scaling factors are used to normal- 
ize and to incorporate the effects of higher-order terms 
in the ALPGEN MC samples. The first factor, K w+jcts , is 
applied to both the W + light parton and Wbb generated 
events, whereas the second multiplicative factor, S wb i, is 
applied only to the Wbb samples. 

To determine K w +i cts i the number of selected W + 
light parton and Wbb events in alpgen {N^_- ts ) is 
scaled to match the data (Nyy+\ e t s ) contribution: 



TABLE I: The experimental K w+ jcts factors (applied after 
taking into account the theoretical factor of 1.3) and the 
S wb j, heavy flavor factors in a zero fe-tagged sample (after 
accounting for the theoretical heavy flavor K factor of 1.47 
for W+ jet). The errors shown are statistical errors only. 
The systematic uncertainty on the K w +i etB (S wb i) factors is 
6% (20%) as described in Sec. X. 



rs-W+2 jets Ty-W+3 jets q 



II •'■'"■ 



Pre-2006 e 1.10 ± 0.01 1.21 ± 0.03 0.78 ± 0.09 

H 1.16 ± 0.01 1.35 ± 0.03 0.99 ± 0.11 

Post-2006 e 1.05 ± 0.01 1.12 ± 0.01 1.14 ± 0.06 

fj, 1.10 ± 0.01 1.21 ± 0.01 1.02 ± 0.06 



The factors K w+ ^ cts are calculated separately for the 
electron and muon channel samples and separately for 
both the TT+2 jet and W+3 jet selections. The obtained 
factors are found to be consistent within their statisti- 
cal and systematic uncertainties and are shown, after ac- 
counting for NLO corrections to the cross section [46] 
(already included in the generated samples), in Table I. 
The values are in the range K w+,cts s=s 1.0 — 1.16 for the 
W+2 jet and K w +i ets « 1.12 - 1.35 for the W+3 jet 
selected samples. The assigned systematic uncertainties 
are described in Sec. X. 

As indicated above, the factor S W bl IS applied addi- 
tionally only to the Wbb heavy parton events: 



N- 



MC 
W+jets 



N, 



MC 
W+light 



o wMC 

D Wbb Iy Wbb 



(3) 



[the same factor is used for the W(— *• lv)bb and W(— > 
lv)cc generated samples and for the corresponding Z/"f* 
heavy flavor samples]. The heavy flavor factor S wbb - is 
extracted by requiring either zero, one, or two 6-tags (see 



tag, data 
W+jets 



events, 



Sec. V) to obtain samples containing N- 

however it is dominated by the single 6-tag samples that 

have the smallest expected signal contribution. The num- 



ber of predicted TT+jet events, N w °?- ts , in the tagged 
samples, after application of the scaling factor x w+ ' ,cts , 
is given by N%$£ = K^+^N^, and the heavy 
flavor contribution can therefore be extracted from 
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The heavy flavor scale factors, determined using sam- 
ples requiring zero 6-tagged jets are also shown in Table 
I. The factors are applied separately for the electron and 
muon channel samples and also for data before and after 
the DO upgrade. The luminosity weighted average of the 
factors is found to be consistent with the theoretically 
expected value [46]. 



VII. MULTIJET BACKGROUND 
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JV W+jets 
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The total MJ background contribution entering each 
of the final selected signal samples is determined from 
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FIG. 1: [color online] The probability for MJ background 
events to enter the final e + 2 jet (post-2006 upgrade) sample. 
The solid, dashed, and dotted curves represent the result of 
fitted parameterizations in each interval. 



FIG. 2: [color online] The probability for MJ background 
events to enter the final fi + 2 jet (post-2006 upgrade) sample. 
The solid and dashed curves represent the result of the fitted 
parameterizations in each interval. 



the data, prior to the application of 6-tagging, using the 
prescription of Ref. [25]. The MJ contributions are de- 
termined in conjunction with the previously described 
ALPGEN normalization factors. Multijet templates are 
obtained from control samples in the data and normal- 
ized through a \ 2 fit to the W boson transverse mass 
distribution. For the determination of the MJ contribu- 
tion, the ALPGEN normalization factors described in Sees. 
VI A and VI B are varied in conjunction with the MJ 
normalization, such that the total number of predicted 
MC and MJ events agrees with the total number of data 
events prior to the application of 6-tagging. 

For both the electron and muon selected events, addi- 
tional data samples selected with the much looser lepton- 
identification criteria (see Sec. IV) are used. Events en- 
tering the looser samples (L) are a combination of true 
leptonic events and MJ background in which a jet is 
misidentified as a lepton. Upon application of the tighter 
(T) final selection criteria the remaining contributions of 
true leptonic and background events depend upon the 
(relative) efficiency e£ T for true leptons to subsequently 
pass the final selection criteria, and the probability P^ 
that MJ background events in the looser sample subse- 
quently enter the tighter, final signal samples. A weight 
wi is assigned to each event i in the looser selected sam- 
ples according to 
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where 0* = 1 if the event i in the loose sample passes 
the tight selection requirements and is zero otherwise. 
The total MJ background contributions in the final sig- 
nal samples are given by the sum of the event weights 



Wi in the corresponding loose samples. The efficiencies 
£ lt i are functions of lepton px and are determined from 
Z/-f* -> l + l~ events. The probabilities P^i are deter- 
mined from the measured ratio of the number of events 
in the final to loosely selected samples after correcting 
each sample for the expected MC contribution from the 
leptons in the specific kinematic interval. For both the 
final electron and muon samples, the probability for MJ 
events to enter the final selected samples is extracted in 
the region 5 < $t < 15 GeV [and without applying the 
additional requirement on Tfix in Eq. 7 of Sec. VIII]. 



A. Parameterization of the Misidentified Jet 
Probability 

The measured probability for MJ events to enter the 
final electron plus two-jet selection sample is shown as a 
function of electron px in Fig. 1. The MJ contribution in 
the electron channel arises from jets with a high enough 
fraction of energy deposited within the EM section of 
the calorimeter that they satisfy the electron identifica- 
tion criteria. Additional MJ backgrounds in the elec- 
tron channel originate from the semileptonic decays of 
hadrons and from photons that are misidentified as elec- 
trons. The probability is measured separately in two CC 
regions (|?/| < 0.7 and 0.7 < |?7| < 1.1) and separately in 
the EC region (1.5 < \r)\ < 2.5). In each range of \rj\, the 
misidentified jet probability is parameterized as a func- 
tion of electron px in four intervals of the azimuthal sep- 
aration A(j>($T, e) of the electron and the Jfix vector (the 
four regions are shown combined for each |tj| interval in 
Fig. 1). In the CC region, the probabilities are parame- 
terized as sums of exponentials and first-order polynomi- 
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FIG. 3: [color online] Comparison of simulated events, including data-determined MJ background, to the W+2 jet selected 
data (black points) for (a) isolated lepton px, (b) missing event transverse energy Et, (c) transverse mass of the (I, fir) system, 
(d) pr of W boson candidates, (e) leading jet pr, (f) Pt of the second leading jet, (g) scalar sum of the pt of jets in the event 
(Ht), (h) transverse momentum of the dijet system, (i) separation A_R, and (j) azimuthal separation A(f> for the two jets. The 
expectation for a WH signal at Mh — 115 GeV has been scaled up by a factor of 300. The electron and muon selected samples 
are combined in the figures. 
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FIG. 4: [color online] Dijet invariant mass distribution for the W+2 jet selected samples on linear and logarithmic scales for 
(a), (d) no 6-tagging, (b), (e) events that contain two b-tagged jets, and (c), (f) events that fail the two-tagged requirement but 
contain a single NN 6-tagged jet. The expectation for a W H signal at Mh = 115 GeV has been scaled up by a factor of 10. 



als in electron px, whereas only a first-order polynomial 
in electron pt is used in the smaller statistics EC region. 
For the smaller statistics electron W+3 jet sample, the 
probabilities are determined once for each \rj\ region, and 
are applied to each A0(^?T,e) interval separately after 
scaling to the average contribution obtained in that in- 
terval. 

The measured probability for MJ background events 
to enter the final /i + 2jet sample is shown as a function 
of muon \rj\ in Fig. 2. The primary source of MJ back- 
ground in the muon channel is from semileptonic decays 
of heavy quarks in which the decay muon satisfies the 
muon isolation criteria. The contribution of MJ events 
entering the loose sample is smaller in the muon channel 
than in the electron channel. Consequently, the misiden- 
tified jet probability is parameterized in only two regions 
[\A<f)(]£ T ,n)\ < tt/2 and tt/2 < |A0(# T ,/x)| < vr] of az- 
imuthal separation A4>(^t > A*) between the muon px and 
the $t vectors. In both pre-2006 and post-2006 upgrade 
data, the misidentification probability is parameterized 
using a third-order polynomial in muon |7y| 2 . The same 
functions are applied to both the muon W+2 jet and 
VF+3 jet selected samples. 



VIII. EVENT SELECTION 

This section describes the selection of data samples 
containing events with a single reconstructed lepton, $t, 
and either two or three jets of transverse momentum 
Pt > 20 GeV, at least one of which is required to be 
consistent with having evolved from a b quark. The sam- 
ples are from data collected between 2002 and June 2009 
at y/s — 1.96 TeV. Candidate W bosons are selected 
by requiring an electron or a muon with transverse mo- 
menta pr > 15 GeV and $t > 20 GeV. Electrons are 
required to be in the pseudorapidity region \r)\ < 1.1 or 
1-5 < \r]\ < 2.5 and unions in the region \t)\ < 1.6. The 
selected W — > ev and W — > [iv candidate events are 
divided into samples containing exactly two or exactly 
three reconstructed jets. Jets are required to be in the re- 
gion | r\ | < 2.5. A selection on the Ht of the jets, Ht > 60 
and > 80 GeV, is also applied to the W+2 jet and W+3 
jet samples, respectively, and the event PV is required to 
be reconstructed within zpv = ±40 cm of the center of 
the detector. At least three charged tracks are required 
to be associated with that vertex. 

Distributions of lepton pt and $t are compared to 
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FIG. 5: [color online] Dijet invariant mass distribution for the VK+3 jet selected samples on linear and logarithmic scales for 
(a), (d) no 6-tagging, (b), (e) events that contain two 6-tagged jets, and (c), (f) events that fail the two-tagged requirement but 
contain a single NN 6-tagged jet (the two leading jets in the VK+3 jet samples are used to form the dijet invariant mass). The 
expectation for a WH signal at Mh = 115 GeV has been scaled up by a factor of 10. 



the sum of the expected SM background contributions 
and data-determined MJ background for the W+2 jet 
selected sample, which has the largest statistics of all 
selected samples, in Figs. 3(a) and (b). The electron and 
muon decay channel samples are combined in the figures, 
and all corrections to the background simulations have 
been applied. Details of the background estimates are 
given in Sees. VI and VII. 

To suppress Z/~f* — > £ + £~ and tt background events 
and to avoid double counting events in Higgs searches 
based on dilepton final states, events with additional elec- 
trons or muons isolated from jets that pass p^ > 20 GeV 
and p^p > 15 GeV are rejected. Events containing iso- 
lated high-pT t leptons that decay hadronically are also 
rejected by requiring pJp < 10 GeV or p r T < 15 GeV, 
depending on the r decay channel [48]. 

The transverse mass of the W boson candidates (Mjf ) 
is reconstructed from the (£,$t) system using the lepton 
transverse energy (Ej,), $t, and the azimuthal separa- 
tion A(/)(£,]£t) between the isolated lepton and the $t 
vector: 



The distribution of Mj! for selected W boson candidates 
is shown in Fig. 3(c). In addition to the dominant con- 
tribution from events with real W boson decays, there is 
a significant component from MJ events that contributes 
mainly at small values of My. Consequently the lower 
signal-to-background region at low J£t is rejected by re- 
quiring 



M^ > 40(GeV) 



0.5$ T - 



(7) 



M^ 



[2E^E T [l - cosA())(£,Et)}} 



(6) 



The p^ distribution for the W boson candidates is 
compared to the sum of the expected SM and MJ back- 
ground contributions, prior to the requirement in Eq. 7, 
in Fig. 3(d). 

Kinematic jet properties for the selected W+2 jet sam- 
ple are also compared to the sum of the expected SM 
background contributions, including MJ background, in 
Fig. 3. The corrected electron and muon channel samples 
are combined in the figure. The background prediction 
provides an adequate description of the data for all the 
distributions. 

To increase the final sensitivity, both the W+2 jet 
and W+3 jet samples are subdivided into statistically 
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FIG. 6: [color online] Comparison of the expected backgrounds to the two-6-tagged jet data sample in W+2 jet selected events. 
The expectation for a W H signal at Mh = 115 GeV has been scaled up by a factor of 10. 



15 



independent samples based on whether one or two of 
the leading jets in the event are consistent with having 
been initiated by a b quark, as discussed in Sec. V. The 
first sample requires two jets, both with NN output val- 
ues larger than a "loose" requirement ("loose-tag"). The 
second sample, selected from events that fail the two- 
tag requirement, requires a single jet with a NN output 
above a larger "tight" value requirement ("tight-tag"). In 
two-6-tagged jet events, the typical efficiency for iden- 
tifying a pt — 50 GeV jet that contains a b hadron is 
(59 ± 1)% with a misidentification probability of 1.5% 
for light parton (u,d,s,g) initiated jets. In the single-6- 
tagged jet event sample, the typical efficiency for iden- 
tifying a pt = 50 GeV jet that contains a b hadron is 
(48 ±1)%, with a lower misidentification probability of 
0.5% for light parton (u,d,s,g) initiated jets. The b- 
tagging efficiency is treated separately from the jet tag- 
gability efficiency. Events that do not satisfy either of 
these tagging requirements are not considered further in 
the analysis. 

The tagging efficiencies for jets that have passed the 
taggability requirements are studied in data and the effi- 
ciencies are applied to the simulation via event weights. 
These weights depend on the px, f], and partonic flavor 
of each tagged jet. In two 6-tagged events, the event 
weights are given by the product of the weights of the 
two 6-tagged jets. In single- 6-tagged jet events, the final 
event weight also accounts for the simulated contribution 
of two-6-tagged jet events that "migrate" to the simulated 
single-6-tagged jet samples. 

Distributions of dijet invariant mass, prior to b- 
tagging, after requiring two 6-tags, and for single-6- 
tagged events, are shown for the W+2 jet and W+3 jet 
selections in Figs. 4 and 5, respectively. The sums of 
the expected backgrounds are compared to the data, and 
the electron and muon channel samples are again shown 
combined in each figure. Comparisons of kinematic prop- 
erties in W+2 jet events are shown in Figs. 6 and 7 for 
the two- and single-6-tagged samples, respectively. The 
expected signal contribution at Mh = 115 GeV is shown 
scaled by a factor of 10 in each figure. 

The total event yields for each of the ^-tagged samples, 
in data and in simulation, are summarized in Table II. 
In two-&-tagged jet events, the dominant backgrounds are 
from Wbb and tt processes. In single-fo-tagged jet events, 
the dominant backgrounds are W boson production in 
association with light or c-quark jets as well as tt pro- 
duction and MJ events. The expected number of signal 
events in each sample is listed for an assumed Higgs mass 
Mh = 115 GeV. The uncertainties quoted are the com- 
bined statistical and systematic uncertainties, and the 
systematic uncertainties are those prior to the applica- 
tion of the fitting procedure applied when determining 
cross section upper limits described in Sec. XL 



TABLE II: Event yields for the W+2 jet and W+3 jet 
samples after requiring two fe-tagged jets or a single fc-tagged 
jet in the events. The expected contributions to the total 
background from the simulated IT+light, data-derived MJ, 
and simulated Wbb, tt, single top quark, and W Z diboson 
samples are also listed. The uncertainties quoted are the 
combined statistical and systematic uncertainties (prior 
to the application of the fitting procedure applied when 
determining cross section upper limits). The expected 
signal contribution is shown for an assumed Higgs mass 
M H = 115 GeV. 





W+2 jet 


W+2 jet 


W+3 jet 


W+3 jet 




2 fe-tag 


1 &-tag 


2 &-tag 


1 &-tag 


IT+light 57.5 ± 9.2 


1290 ± 201 


12.1 ± 1.8 


210 ± 35 


MJ 


56.5 ± 4.2 


663 ± 43 


12.7 ± 1.0 


186 ± 13 


Wbb 


346 ± 93 


1601 ± 383 


47.8 ± 12.9 


358 ± 90 


tt 


177 ± 35 


417 ± 54 


176 ± 35 


633 ± 96 


s-top 


58.3 ± 11.4 


203 ± 33 


13.0 ± 2.7 


53.6 ± 9.1 


WZ 


22.5 ± 3.3 


152.6 ± 17.6 


2.6 ± 1.1 


33.9 ± 4.8 


Total 


718 ± 120 


4326 ± 501 


264 ± 44 


1474 ± 160 


Data 


709 


4316 


301 


1463 


WH 


6.5 ± 1.0 


9.7 ± 0.9 


0.8 ± 0.2 


2.1 ± 0.3 



IX. MULTIVARIATE DISCRIMINANT 

To separate the remaining background from the sig- 
nal, a multivariate random forest (RF) discriminant tech- 
nique [4, 5] is applied independently to each of the 16 sub- 
samples, defined by categorizing events by lepton flavor 
(electron or muon), jet multiplicity (2 jets or 3 jets), b- 
tag multiplicity (single- or two-6-tagged) , and pre- and 
post-upgrade data. The RF technique employs a set 
of decision trees, each of which applies a series of con- 
secutive binary decisions trained on simulated events of 
known origin until a predefined stopping configuration is 
reached. Half of the simulated events are used for train- 
ing and validation, and the remaining half are used to 
estimate the relative contributions of signal and back- 
ground in the data. 

Each individual decision tree examines an initial input 
event training sample and applies selection criteria on a 
list of potentially discriminating variables to subdivide 
the training sample into smaller signal or background 
regions referred to as nodes. At each step, the selection 
criterion is chosen to maximize the positive cross entropy 
"figure of merit" value 



Q = —p hip — qlriq, 



(8) 



where p (q) is the fraction of correctly (incorrectly) clas- 
sified events at each stage. The process is continued until 
a pure signal or pure background node is obtained, and 
the remaining node regions can no longer be further max- 
imized and split without leaving fewer than a prespecified 
minimum number of events in the other daughter sam- 
ples. The resulting output nodes are referred to as leaves. 
For each of the subsamples, the decision tree algorithm 
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FIG. 7: [color online] Comparison of the expected backgrounds to the single-&-tagged jet data sample in W+2 jet selected 
events. The expectation for a WH signal at Mh = 115 GeV has been scaled up by a factor of 10. 



TABLE III: Description of the 20 kinematic input quantities 
provided to each random forest discriminant. 



RF input variable 



Description 



Pt(£-$t system) 
Pt(Ji) 

Pt(J2) 


ma 

Pt (dijet system) 
Afl(jij 2 ) 


H z 


A<j>(j u £) 


\/Ii = S£(i/i + I + jets^ 



\fl 2 = E_B(^2 + I + jets) 



AR(dijet,£ + vi) 



AR(di}et,£ + v 2 ) 



Aplanarity 



cos(x) 



Missing transverse energy 
Lepton-$?T transverse mass 
pr of W boson candidate 
Leading jet pr 
Subleading jet pr 
Dijet invariant mass 
Pt of dijet system 
AR between the two leading jets 
A(j> between the two leading jets 
Scalar sum of the transverse 
momenta of all jets in the event 
Scalar sum of the longitudinal 
momenta of all jets in the event 
A(j> between the leading jet 
and the lepton 
Second leading jet energy 
Center-of-mass energy of the 
i^+^+dijet system with larger 
solution for the longitudinal 
momentum of the v candidate 
Center-of-mass energy of the 
i/+i?+dijet system with smaller 
solution for the longitudinal 
momentum of the v candidate 
AR between the dijet system 
and the £ + v system with larger 
solution for the longitudinal 
momentum of the v candidate 
AR between the dijet system 
and the I + v system with smaller 
solution for the longitudinal 
momentum of the v candidate 
| A3, where A3 is the smallest 
eigenvalue of the normalized 
momentum tensor: 

M - ( ^M1 

where jj, runs over jets and 
the charged lepton and p^ is 
the ith 3-momentum component 
of the /ith physics object. 
Cosine of angle between the W 
candidate and nominal proton 
beam direction in the zero 
momentum frame (see Ref. [49]) 
Cosine of angle between lepton 
and rotated 3-momentum vector 
of the dijet system in the production 
plane of the W boson rest frame [49] 
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are assigned within each decision tree to create the for- 
est. Varying the number of input variables used by ±1 is 
found to have a negligible effect on the RF output. 

The 20 input variables used to build the RF decision 
are optimized in dedicated studies of their discriminating 
power and are listed, together with their definitions, in 
Table III. Agreement between the data and the total MC 
and data-determined background estimates are obtained 
for each input variable distribution for both the two- 6- 
tagged and single-6-tagged samples as well as for the full 
sample prior to the application of ^-tagging. The same 
set of input variables is used for the W+2 jet and W+3 
jet samples. In addition to the ten variables already dis- 
cussed in Sec. VIII, and displayed in Figs. 6 and 7, a 
further ten discriminating variables are provided to each 
RF and these are shown for the W+2 jet sample, after 
the application of two and one 6-tag requirements to the 
events, in Figs. 8 and 9, respectively. 

Two input distributions are provided for y/S and 
Ai?(dijet,£+^) corresponding to each of the two solutions 
for the longitudinal momentum component of the miss- 
ing energy vector (assuming the lepton and 1£t are decay 
products of an on-shell W boson) . The angles 6* and \ 
are included to exploit kinematic differences arising from 
the expected spin-0 nature of the Higgs and non-spin-0 
nature of the Wbb background. The angle 8* is the angle 
between the W boson candidate and the nominal proton 
beam direction in the zero momentum frame, and x 1S 
the angle between the charged decay lepton and rotated 
(production plane) three-momentum vector of the dijet 
system after boosting to the W boson rest frame [49]. 

Each RF is trained simultaneously using all simulated 
backgrounds sources (the MJ contribution is excluded) 
for each simulated Higgs mass point, and the process is 
repeated for each of the 16 subsamples. The minimum 
number of events in a leaf is tuned in separate studies and 
the number that maximizes the sensitivity is selected. 
The number of decision trees used within each forest is 
also studied and tuned using the procedure of Ref. [50] . 

The resulting RF output distributions are shown in 
Figs. 10 and 11 for the two- and single-6-tagged jet re- 
quirements in the final W+2 jet and W+3 jet samples, 
respectively. The electron and muon channel samples 
have been combined in the figures, and the preupgrade 
and postupgrade samples are also combined in the fig- 
ures. The figures show the results obtained using the 
Mh = 115 GeV signal samples. An improved separa- 
tion of simulated signal and background contributions, is 
obtained in all cases. 



is run multiple times to create the forest and variants of 
the default training sample are used for each decision tree 
within each RF. The outputs of the decision trees within 
each RF are combined to yield final RF output distri- 
butions. The decision tree samples are obtained using 
bootstrap aggregation ("bagging"), and a random subset 
of 13 of the 20 input discriminating variable distributions 



X. SYSTEMATIC UNCERTAINTIES 

The impact of each possible source of systematic un- 
certainty is assessed separately for the signal and for all 
backgrounds, for each of the 16 statistically independent 
samples, and categorized according to whether it affects 
the normalization and the shape (shape systematic) of 
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FIG. 8: Comparison of the total backgrounds to data for the additional variables provided as inputs to the random forests. 
The distributions are compared after requiring two-6-tagged jets in W+2 jet events. Each variable is defined in Table III. The 
expectation for a W H signal at Mh — 115 GeV has been scaled up by a factor of 10. 
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the RF discriminant output distributions or whether it 
only affects the normalization of signal and backgrounds. 
A full analysis is repeated after individually varying each 
source by ±1 standard deviation (s.d.) in the simula- 
tion, except where noted otherwise (the uncertainty in 
the MJ background modeling is determined separately 
from data). After each variation, the simulated and MJ 
background yields are normalized to the selected data 
samples prior to the application of 6-tagging. 

The systematic uncertainty assigned to the data- 
determined efficiency of the triggers used in the electron 
channel is (3-5)%. In the muon channel, where the full 
list of available triggers is used, a comparable uncertainty 
of (3-4)% is assigned. In the muon channel, this uncer- 
tainty arises from a normalization uncertainty of 2%, ob- 
tained after comparing results using the single high-py 
muon and the full list of triggers, and a shape systematic 
of (1-3)% as a function of jet pt, applied to the non- 
single-muon trigger efficiency. The shape systematic is 
obtained by comparison of the single high-p^ muon and 
non-single-muon triggered components of the dataset. 

The uncertainty on the identification and reconstruc- 
tion of isolated electrons, as well as their energies, af- 
fects the shapes of the electron channel RF distributions 
and is (5-6)%. In the muon channel, the uncertainty 
comprises three contributing sources: an uncertainty of 
0.8% applied to the pre-upgrade muon identification effi- 
ciency (a 1.2% uncertainty is applied to the post-upgrade 
samples, which is increased for muon px < 20 GeV by 
adding 2% in quadrature), an uncertainty in the corre- 
sponding track reconstruction of 2.3% (pre-upgrade) and 
1.4% (post-upgrade), and an uncertainty of 3.8% (pre- 
upgrade) and 0.9% (post-upgrade) on the scale factors 
used to correct the efficiencies for muons to pass isola- 
tion criteria in the MC to those measured in the data. 

Sources of systematic uncertainty on the selection and 
reconstruction of jets are the jet resolution and jet energy 
scale, as well as the jet identification efficiency and vertex 
confirmation requirement (applied to the post-upgrade 
part of the dataset). Shape uncertainties for jet reso- 
lution and jet energy scale are determined by varying 
parameters in the jet resolution function and the energy 
scale correction and repeating the analysis using the kine- 
matics of the modified jets. The size of this effect on 
the RF distribution depends on the sample and process 
and is in the range 15%-30%. The jet identification and 
vertex confirmation uncertainties are each determined by 
randomly reducing the number of jets that remain in sim- 
ulation (the +1 s.d. result is then obtained by inverting 
the —1 s.d. result). The resulting RF shape systematic 
uncertainty is about 5%. Because of low statistics after 
6-tagging for the IT+light and WZ samples, the jet sys- 
tematic uncertainties applied for these backgrounds are 
determined prior to ^-tagging. 

The uncertainty on the jet taggability requirement is 
determined by varying the jet taggability correction fac- 
tors. The taggability uncertainty affects the shapes of the 
RF output distributions and is about 3%. The RF shape 



uncertainty for the response of the ^-tagging algorithm is 
applied separately for light and heavy flavored jets and 
is typically (2.5-3.0)% for single-tagged heavy flavor jets 
and in the range (1-4)% for single-tagged light jets (the 
light-quark jet mistag probability uncertainty is of order 
10%). The RF uncertainty is approximately doubled in 
the samples requiring two 6-tagged jets. 

Uncertainties in the predicted tt 1 single top quark, 
and diboson cross sections are taken from [42, 45, 46] 
and affect the normalizations of the backgrounds. The 
uncertainty on the CTEQ6L parton density function is 
estimated following the prescription of Ref. [39]. The 
ALPGEN-generated samples include additional normaliza- 
tion factors that change their visible cross sections, and 
their uncertainties are determined separately. The un- 
certainty in the reweighting procedure applied to the 
ALPGEN-generated event samples affects the shape of the 
ALPGEN RF output distributions and are typically of the 
order 2%. The uncertainty on the ALPGEN scale factor 
^w+jcts ig 6% and the uncerta i n ty on S wbi is 20%. The 

renormalization and factorization scales used in ALPGEN 
are varied by adjusting each scale simultaneously, by fac- 
tors of 0.5 and 2.0. This affects the shapes of the ALPGEN 
RF output distributions, and the resulting uncertainty is 
of the order 2%, as is the uncertainty arising from the 
choice of value for the strong coupling constant as ■ The 
uncertainty on the MLM factorization scheme used to 
match ALPGEN partons to cone jets is propagated to the 
RF distribution and results in a systematic uncertainty 
of about 2%. 

The uncertainty in the MJ background modeling is ob- 
tained from the data. It is determined by varying the 
parameterization of the efficiency for loosely selected lep- 
tons to enter the final selected sample and by also vary- 
ing the misidentified jet probabilities. The MJ uncer- 
tainties are anticorrelated with the normalization of the 
ALPGEN samples, and this is taken into account in the 
limit setting procedure. The overall experimental sys- 
tematic uncertainty assigned to the W H distributions is 
about 6%. The uncertainty of the experimentally mea- 
sured integrated luminosity is treated separately. The 
uncertainty is 6.1% [22] and is fully correlated between 
all of the simulated background samples. 



XI. UPPER LIMITS ON THE WH CROSS 
SECTION 

No excess of events is observed with respect to the 
background estimation and upper limits are therefore de- 
rived for the WH production cross section multiplied by 
the corresponding H — > bb branching ratio in units of the 
SM prediction. The limits are calculated using the modi- 
fied frequentist CL S approach [51, 52], and the procedure 
is repeated for each assumed value of Mh ■ 

Two hypotheses are considered: the background-only 
hypothesis (B), in which only background contributions 
are present, and the signal-plus-background (S+B) hy- 
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FIG. 9: Comparison of the total backgrounds to data for the additional variables provided as inputs to the random forests. 
The distributions are compared after requiring a single-b-tagged jet in W+2 jet events. Each variable is defined in Table III. 
The expectation for a W H signal at Mh = 115 GeV has been scaled up by a factor of 10. 
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FIG. 10: [color online] Output RF distributions on linear and logarithmic scales for (a), (c) single (one-b-tag) and (b), (d) 
two-fe-tagged W+2 jet events. The expectation for signal at Mh = 115 GeV (solid black line) is scaled by a factor of 10. 



pothesis in which both signal and background contribu- 
tions are present. 

The limits are determined using the RF output distri- 
butions, together with their associated uncertainties, as 
inputs to the limit setting procedure. To preserve the 
stability of the limit derivation procedure in regions of 
small background, the width of the bin at the largest 
RF output value is adjusted by comparing the total B 
and S+B expectations until the statistical significance 
for B and S+B is, respectively, greater than « 3.6 and 
5.0 standard deviations from zero. The remaining part of 
the distribution is then divided into 23 equally-sized bins. 
The rebinning procedure is checked for potential biases 
in the determination of the final limits, and no such bias 
is found. 

The result for each hypothesis is obtained by testing 
the outcome of a large number of simulated pseudoex- 
periments. For each pseudoexperiment, pseudodata are 
drawn from the RF distributions, by randomly generat- 
ing the pseudodata according to a Poisson statistical par- 
ent distribution for which the mean is either taken from 
the background-only or signal-plus-background hypoth- 
esis. A negative Poisson log likelihood ratio (LLR) test 
statistic is used to evaluate the statistical significance of 
each experiment, with the outcomes ordered in terms of 



their statistical significance. The frequency of each out- 
come defines the shapes of the resulting LLR distribution, 
for both the background-only and signal-plus-background 
hypotheses, at each mass point. 

Systematic uncertainties are defined through nuisance 
parameters that are assigned Gaussian probability dis- 
tributions (priors). The signal and background predic- 
tions are taken to be functions of the nuisance parameters 
and each nuisance parameter is sampled from a Gaussian 
probability distribution in each pseudoexperiment. The 
correlated systematic uncertainties across channels (such 
as the uncertainties on predicted SM cross sections, iden- 
tification efficiencies, and energy calibration, as described 
in Sec. X) are also taken into account in the limit setting 
procedure [53]. 

The inclusion of systematic uncertainties in the gener- 
ation of pseudoexperiments has the effect of broadening 
the LLR distributions and, thus, reducing the ability to 
resolve signal-like excesses. This degradation can be par- 
tially reduced by performing a maximum likelihood fit 
to each pseudoexperiment (and data), once each for the 
S+B and the background-only hypotheses. The maxi- 
mization is performed over the systematic uncertainties. 
The LLR is evaluated for each outcome using the ratio 
of maximum likelihoods for the fit to each hypothesis. 
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FIG. 11: [color online] Output RF distributions on linear and logarithmic scales for (a), (c) single (one-b-tag) and (b), (d) 
two-fe-tagged W+3 jet events. The expectation for signal at Mh = 115 GeV (solid black line) is scaled by a factor of 10. 



The resulting degradation of the limits due to systematic 
uncertainties is of the order of 30%. 

The medians of the obtained background-only LLR 
distributions for each tested mass point are summa- 
rized in Fig. 12. The resulting medians of the signal- 
plus-background hypothesis LLR distributions are also 
shown. The corresponding ±lcr and ±2(7 values for the 
background-only hypothesis at each mass point are rep- 
resented by the shaded regions in the figure. The LLR 
values obtained from the data are also summarized in the 
figure. 

The RF discriminant distributions after the 
background-only profile fit are shown in Fig. 13 af- 
ter subtracting the total background expectation, for 
the Higgs boson mass points Mh = 100,115,130, and 
140 GeV. The signal expectations are shown scaled 
to the final observed upper limits (rounded to the 
nearest integer) in each case, and the uncertainties in 
the background before and after the constrained fit are 
shown by the shaded bands and solid lines, respectively. 

Upper limits are calculated at 11 discrete values of the 
Higgs boson mass, spanning the range 100-150 GeV and 
spaced in units of 5 GeV, by scaling the expected signal 
contribution to the value at which it can be excluded at 
the 95% C.L. The expected limits are calculated from 



TABLE IV: The expected and observed 95% C.L. limits from 
the likelihood fit, maximized over systematics, as a function 
of the hypothetical Higgs mass Mh- The limits are presented 
as ratios of o(pp — > W H) x BTZ(H — > bb) to the expected SM 
prediction. 

Combined 95% C.L. Limit /<tsm 



Higgs Mass [GeV] Expected 



Observed 



100 


3.3 


2.7 


105 


3.6 


4.0 


110 


4.2 


4.3 


115 


4.8 


4.5 


120 


5.6 


5.8 


125 


6.8 


6.6 


130 


8.5 


7.0 


135 


11.5 


7.6 


140 


16.5 


12.2 


145 


23.6 


15.0 


150 


36.8 


30.4 



the background-only LLR distribution whereas the ob- 
served limits are quoted with respect to the LLR values 
measured in data. The expected and observed 95% C.L. 
upper limits results for the WH cross section multiplied 
by the branching ratio H — ¥ bb are shown, as a function 
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FIG. 12: [color online] The observed LLR as a function of hypothetical Higgs boson mass. Also shown are the medians of 
the resulting LLR distributions for the background-only hypothesis (dashed line), along with the ±lu and ±2<r values (shaded 
bands), after generating multiple pseudoexperiments at each test mass point. The medians of the signal-plus-background 
hypothesis are shown by the dash-dotted line. 



of the Higgs boson mass Mh, in units of the SM predic- 
tion in Fig. 14. The values obtained for the expected and 
observed limit to SM ratios at each mass point are listed 
in Table IV (the uncertainty in the predicted WH cross 
section is available in Ref. [38]). 



XII. SUMMARY 



a factor 4.5 (4.8) larger than the SM expectation for a 
Higgs mass Mh — 115 GeV. These results, combined 
with those of Ref. [54] and with other searches in this 
mass region at the Tevatron, provide crucial constraints 
on the Higgs coupling to bb. complementary to the 
information obtained, for other decay modes, by the 
LHC experiments. 



A search for the SM associated WH production 
in data corresponding to an integrated luminosity of 
C sa 5.3 fb _1 collected with the DO detector at the 
Fermilab Tevatron pp Collider shows no excess beyond 
the expected contributions from SM backgrounds. Sta- 
tistically independent data samples containing W —>■ ev 
and W — > fj,u candidates with either two or three 
reconstructed jets in the event and subdivided into two 
6-tagged jets or a single 6-tagged jet are analyzed using 
a multivariate technique to provide separation of signal 
and background. Upper limits are calculated at the 95% 
C.L. for the WH cross section multiplied by the branch- 
ing ratio H ->• bb for the region 100 < Mh < 150 GeV. 
The observed (expected) upper limits at 95% C.L. are 
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FIG. 13: [color online] The RF discriminant output distribution minus the total background expectation for (a) Mh = 100 GeV, 
(b) Mh = 115 GeV, (c) Mh = 130 GeV, and (d) Mh — 140 GeV. The prefit uncertainties are shown by the shaded bands and 
the post-fit uncertainties are represented by the solid lines. The signal expectation is shown scaled to the obtained observed 
upper limit at each test mass point. 
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